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Figure 4.1 You can use probability and discrete random variables to calculate the likelihood of lightning striking the 
ground five times during a half-hour thunderstorm. (Credit: Leszek Leszczynski) 



Introduction 



Chapter Objectives 



By the end of this chapter, the student should be able to: 

• Recognize and understand discrete probability distribution functions, in general. 

• Calculate and interpret expected values. 

• Recognize the binomial probability distribution and apply it appropriately. 

• Recognize the Poisson probability distribution and apply it appropriately. 

• Recognize the geometric probability distribution and apply it appropriately. 

• Recognize the hypergeometric probability distribution and apply it appropriately. 

• Classify discrete word problems by their distributions. 



A student takes a ten-question, true-false quiz. Because the student had such a busy schedule, he or she could not study and 
guesses randomly at each answer. What is the probability of the student passing the test with at least a 70%? 

Small companies might be interested in the number of long-distance phone calls their employees make during the peak time 
of the day. Suppose the average is 20 calls. What is the probability that the employees make more than 20 long-distance 
phone calls during the peak time? 
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These two examples illustrate two different types of probability problems involving discrete random variables. Recall that 
discrete data are data that you can count. A random variable describes the outcomes of a statistical experiment in words. 
The values of a random variable can vary with each repetition of an experiment. 

Random Variable Notation 

Upper case letters such as X or Y denote a random variable. Lower case letters like x or y denote the value of a random 
variable. If A is a random variable, then X is written in words, and x is given as a number. 

For example, let X = the number of heads you get when you toss three fair coins. The sample space for the toss of three fair 
coins is TTT; THH; HTH ; HHT; HTT; THT; TTH ; HHH. Then, x = 0, 1, 2, 3. X is in words and x is a number. Notice that 
for this example, the x values are countable outcomes. Because you can count the possible values that X can take on and the 
outcomes are random (the x values 0, 1, 2, 3), X is a discrete random variable. 




ollaborative Exercise 



Toss a coin ten times and record the number of heads. After all members of the class have completed the experiment 
(tossed a coin ten times and counted the number of heads), fill in Table 4.1. Let X = the number of heads in ten tosses 
of the coin. 



X 


Frequency of x 


Relative Frequency of x 







































Table 4.1 



a. Which value(s) of x occurred most frequently? 

b. If you tossed the coin 1,000 times, what values could x take on? Which value(s) of x do you think would occur 
most frequently? 

c. What does the relative frequency column sum to? 



4.1 1 Probability Distribution Function (PDF) for a Discrete 
Random Variable 

A discrete probability distribution function has two characteristics: 

1. Each probability is between zero and one, inclusive. 

2. The sum of the probabilities is one. 



Example 4.1 



A child psychologist is interested in the number of times a newborn baby's crying wakes its mother after midnight. 
For a random sample of 50 mothers, the following information was obtained. Let X = the number of times per 
week a newborn baby's crying wakes its mother after midnight. For this example, x = 0, 1, 2, 3, 4, 5. 

P(x) = probability that X takes on a value x. 
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X 


P(x) 


0 


II 

/- — \ 

o 

II 

ST 


1 


P <"=1)=35 


2 


P(x = 2)= 50 


3 




4 


II 

II 

ST 


5 


p <* = 5 >= 35 



Table 4.2 



X takes on the values 0, 1, 2, 3, 4, 5. This is a discrete PDF because: 

a. Each P(x) is between zero and one, inclusive. 

b. The sum of the probabilities is one, that is, 



J_ + Ii + 23 + J_ + A + X=i 

50 50 50 50 50 50 



Try It 



4.1 A hospital researcher is interested in the number of times the average post-op patient will ring the nurse during 
a 12-hour shift. For a random sample of 50 patients, the following information was obtained. Let X = the number of 
times a patient rings the nurse during a 12-hour shift. For this exercise, x = 0, 1, 2, 3, 4, 5. P(x) = the probability that X 
takes on value x. Why is this a discrete probability distribution function (two reasons)? 



X 


PM 


0 


P(X=0)=4y 


1 


P(x = l)=A 


2 


P(x = 2)=if 


3 


P(x = 3)=M 


4 


P(x = 4)=|y 


5 


P(x = 5)= A 



Table 4.3 
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Example 4.2 



Suppose Nancy has classes three days a week. She attends classes three days a week 80% of the time, two days 
15% of the time, one day 4% of the time, and no days 1% of the time. Suppose one week is randomly selected. 

a. Let X = the number of days Nancy . 

Solution 4.2 

a. Let X = the number of days Nancy attends class per week. 

b. X takes on what values? 

Solution 4.2 

b. 0, 1, 2, and 3 

c. Suppose one week is randomly chosen. Construct a probability distribution table (called a PDF table) like the 
one in Example 4.1. The table should have two columns labeled x and P(x). What does the P(x) column sum to? 

Solution 4.2 

c. 



X 


P(x) 


0 


0.01 


1 


0.04 


2 


0.15 


3 


0.80 



Table 4.4 



4.2 Jeremiah has basketball practice two days a week. Ninety percent of the time, he attends both practices. Eight 
percent of the time, he attends one practice. Two percent of the time, he does not attend either practice. What is X and 
what values does it take on? 



4.2 | Mean or Expected Value and Standard Deviation 

The expected value is often referred to as the "long-term" average or mean. This means that over the long term of doing 
an experiment over and over, you would expect this average. 

You toss a coin and record the result. What is the probability that the result is heads? If you flip a coin two times, does 
probability tell you that these flips will result in one heads and one tail? You might toss a fair coin ten times and record 
nine heads. As you learned in Section 3., probability does not describe the short-term results of an experiment, ft gives 
information about what can be expected in the long term. To demonstrate this, Karl Pearson once tossed a fair coin 24,000 
times! He recorded the results of each toss, obtaining heads 12,012 times. In his experiment, Pearson illustrated the Law 
of Large Numbers. 

The Law of Large Numbers states that, as the number of trials in a probability experiment increases, the difference 
between the theoretical probability of an event and the relative frequency approaches zero (the theoretical probability and 
the relative frequency get closer and closer together). When evaluating the long-term results of statistical experiments, 
we often want to know the “average” outcome. This “long-term average” is known as the mean or expected value of the 
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experiment and is denoted by the Greek letter p. In other words, after conducting many trials of an experiment, you would 
expect this average value. 

NOTE 

To find the expected value or long term average, p, simply multiply each value of the random variable by its probability 
and add the products. 



Example 4.3 



A men's soccer team plays soccer zero, one, or two days a week. The probability that they play zero days is 0.2, 
the probability that they play one day is 0.5, and the probability that they play two days is 0.3. Find the long-term 
average or expected value, p, of the number of days per week the men's soccer team plays soccer. 

To do the problem, first let the random variable X = the number of days the men's soccer team plays soccer per 
week. X takes on the values 0, 1, 2. Construct a PDF table adding a column x*P(x). In this column, you will 
multiply each x value by its probability. 



X 


P(x) 


x*P(x) 


0 


0.2 


(0)(0.2) = 0 


1 


0.5 


(1)(0.5) = 0.5 


2 


0.3 


(2)(0.3) = 0.6 



Table 4.5 Expected Value 
Table This table is called 
an expected value table. 
The table helps you 
calculate the expected 
value or long-term 
average. 



Add the last column x*P(x ) to find the long term average or expected value: (0)(0.2) + (1)(0.5) + (2)(0.3) = 0 + 
0.5 + 0.6 = 1.1. 

The expected value is 1.1. The men's soccer team would, on the average, expect to play soccer 1.1 days per week. 
The number 1.1 is the long-term average or expected value if the men's soccer team plays soccer week after week 
after week. We say p = 1.1. 



Example 4.4 



Find the expected value of the number of times a newborn baby's crying wakes its mother after midnight. 
The expected value is the expected number of times per week a newborn baby's crying wakes its mother after 
midnight. Calculate the standard deviation of the variable as well. 



X 


P(x) 


X*P(X) 


(X-J i/) 2 -P(x) 


0 


P(* = 0)=i 


(0) 


(A) 


= 0 


(0 - 2.1) 2 • 0.04 = 0.1764 


1 




(1) 


(A) 


= 11 
50 


(1-2.1) 2 • 0.22 = 0.2662 



Table 4.6 You expect a newborn to wake its mother after 
midnight 2.1 times per week, on the average. 
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X 


P(x) 


x*P(x) 


(x -1 u) 2 -P(x) 


2 


p(x = 2)= 50 




(2 - 2.1) 2 • 0.46 = 0.0046 


3 


P(* = 3)= A 




(3-2.1) 2 • 0.18 = 0.1458 


4 


II 

II 


(4) ) = — 

50 


(4-2.1) 2 • 0.08 = 0.2888 


5 




(5) (i) = 50 


(5-2.1) 2 • 0.02 = 0.1682 



Table 4.6 You expect a newborn to wake its mother after 
midnight 2.1 times per week, on the average. 



Add the values in the third column of the table to find the expected value of X : 

\i = Expected Value = =2.1 

Use p to complete the table. The fourth column of this table will provide the values you need to calculate the 
standard deviation. For each value x, multiply the square of its deviation by its probability. (Each deviation has 
the format x- if). 

Add the values in the fourth column of the table: 

0.1764 + 0.2662 + 0.0046 + 0.1458 + 0.2888 + 0.1682 = 1.05 

The standard deviation of X is the square root of this sum: a = Vl .05 ~ 1.0247 






Try It 



4.4 A hospital researcher is interested in the number of times the average post-op patient will ring the nurse during 
a 12-hour shift. For a random sample of 50 patients, the following information was obtained. What is the expected 
value? 



X 


P(x) 


0 


II 

o 

II 

CL 


1 


P(x = l)=A 


2 


P(x = 2)=f 


3 


P<x = 3)=f 


4 


P(x = 4)=A 


5 


P(x = 5)=A 



Table 4.7 
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Example 4.5 



Suppose you play a game of chance in which five numbers are chosen from 0, 1, 2, 3, 4, 5, 6, 7, 8, 9. A computer 
randomly selects five numbers from zero to nine with replacement. You pay $2 to play and could profit $100,000 
if you match all five numbers in order (you get your $2 back plus $100,000). Over the long term, what is your 
expected profit of playing the game? 

To do this problem, set up an expected value table for the amount of money you can profit. 

Let X = the amount of money you profit. The values of x are not 0, 1, 2, 3, 4, 5, 6, 7, 8, 9. Since you are interested 
in your profit (or loss), the values of x are 100,000 dollars and -2 dollars. 

To win, you must get all five numbers correct, in order. The probability of choosing one correct number is -yL 
because there are ten numbers. You may choose a number more than once. The probability of choosing all five 
numbers correctly and in order is 

(toXtoXtoXtoXtc) = = °- ooo°i. 

Therefore, the probability of winning is 0.00001 and the probability of losing is 

1 - 0.00001 = 0 . 99999 . 



The expected value table is as follows: 





X 


P(x) 


x*P(x) 


Loss 


-2 


0.99999 


(-2)(0. 99999) = -1.99998 


Profit 


100,000 


0.00001 


(100000)(0.00001) = 1 



Table 4.8 Add the last column. -1.99998 + 1 = 
-0.99998 



Since -0.99998 is about -1, you would, on average, expect to lose approximately $1 for each game you play. 
However, each time you play, you either lose $2 or profit $100,000. The $1 is the average or expected LOSS per 
game after playing this game over and over. 




4.5 You are playing a game of chance in which four cards are drawn from a standard deck of 52 cards. You guess the 
suit of each card before it is drawn. The cards are replaced in the deck on each draw. You pay $1 to play. If you guess 
the right suit every time, you get your money back and $256. What is your expected profit of playing the game over 
the long term? 



Example 4.6 



Suppose you play a game with a biased coin. You play each game by tossing the coin once. P(heads) = ^ and 

P(tails) = y . If you toss a head, you pay $6. If you toss a tail, you win $10. If you play this game many times, 
will you come out ahead? 

a. Define a random variable X. 

Solution 4.6 





U<|K> 
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a. X = amount of profit 

b. Complete the following expected value table. 





X 






WIN 


10 


1 

3 




LOSE 






-12 

3 



Table 4.9 



Solution 4.6 

b. 





X 


P(x) 


xP(x) 






1 


10 


WIN 


10 


3 


3 






2 


-12 


LOSE 


-6 


3 


3 



Table 4.10 



c. What is the expected value, p? Do you come out ahead? 

Solution 4.6 

c. Add the last column of the table. The expected value p = -j 1 - . You lose, on average, about 67 cents each time 
you play the game so you do not come out ahead. 



4.6 Suppose you play a game with a spinner. You play each game by spinning the spinner once. P(red) = ^ , P(blue) 

, and P(green) = . If you land on red, you pay $10. If you land on blue, you don't pay or win anything. If you 

land on green, you win $10. Complete the following expected value table. 





X 


P(x) 




Red 






20 

5 


Blue 




2 

5 




Green 


10 







Table 4.11 



This content is available for free at https://cnx.Org/content/colll562/l.17 




CHAPTER 4 | DISCRETE RANDOM VARIABLES 235 



Like data, probability distributions have standard deviations. To calculate the standard deviation (a) of a probability 
distribution, find each deviation from its expected value, square it, multiply it by its probability, add the products, and take 
the square root. To understand how to do the calculation, look at the table for the number of days per week a men's soccer 
team plays soccer. To find the standard deviation, add the entries in the column labeled (x - p) P(x) and take the square root. 



X 


PM 


x*P(x) 


(x - p) 2 P(x) 


0 


0.2 


(0)(0.2) = 0 


(0-l.l) 2 (0.2) = 0.242 


1 


0.5 


(1)(0.5) = 0.5 


(1-1.1) 2 (0.5) = 0.005 


2 


0.3 


(2)(0.3) = 0.6 


(2 - 1.1) 2 (0.3) = 0.243 



Table 4.12 



Add the last column in the table. 0.242 + 0.005 + 0.243 = 0.490. The standard deviation is the square root of 0.49, or a = 
V0.49 = 0.7 

Generally for probability distributions, we use a calculator or a computer to calculate p and cr to reduce roundoff error. For 
some probability distributions, there are short-cut formulas for calculating p and a. 



Example 4.7 



Toss a fair, six-sided die twice. Let X = the number of faces that show an even number. Construct a table like 
Table 4.11 and calculate the mean p and standard deviation cr of X. 

Solution 4.7 

Tossing one fair six-sided die twice has the same sample space as tossing two fair six-sided dice. The sample 
space has 36 outcomes: 



(1, 1) 


(1, 2) 


(1, 3) 


(1, 4) 


(1, 5) 


(1,6) 


(2, 1) 


(2, 2) 


(2, 3) 


(2. 4) 


(2, 5) 


(2,6) 


(3, 1) 


(3, 2) 


(3, 3) 


(3. 4) 


(3, 5) 


(3,6) 


(4. 1) 


(4. 2) 


(4, 3) 


(4, 4) 


(4. 5) 


(4,6) 


(5, 1) 


(5, 2) 


(5, 3) 


(5. 4) 


(5, 5) 


(5,6) 


(6, 1) 


(6, 2) 


(6, 3) 


(6. 4) 


(6, 5) 


(6,6) 



Table 4.13 



Use the sample space to complete the following table: 



X 


P(x) 


xP(x) 


(X-j U ) 2 • P(x) 


0 


9 

36 


0 


(0 - 1)2 -|r & 


1 


18 

36 


18 

36 


d-i) 2 -f =o 




9 


18 




Z 


36 


36 



Table 4.14 Calculating p and a. 
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o/r 

Add the values in the third column to find the expected value: /j = ^ = 1. Use this value to complete the fourth 
column. 



/ 1 8 

Add the values in the fourth column and take the square root of the sum: a = [4^ ~ 0.7071. 



Example 4.8 



On May 11, 2013 at 9:30 PM, the probability that moderate seismic activity (one moderate earthquake) would 
occur in the next 48 hours in Iran was about 21.42%. Suppose you make a bet that a moderate earthquake will 
occur in Iran during this period. If you win the bet, you win $50. If you lose the bet, you pay $20. Let X = the 
amount of profit from a bet. 

P(win) = P(one moderate earthquake will occur) = 21.42% 

P(loss) = P(one moderate earthquake will not occur) = 100% - 21.42% 

If you bet many times, will you come out ahead? Explain your answer in a complete sentence using numbers. 
What is the standard deviation of XI Construct a table similar to Table 4.12 and Table 4.12 to help you answer 
these questions. 

Solution 4.8 





X 


P(x) 


x(Px) 


(X-J i/) 2 P(x) 


win 


50 


0.2142 


10.71 


[50 - (-5.006)] 2 (0.2142) = 648.0964 


loss 


-20 


0.7858 


-15.716 


[-20 - (-5.006)] 2 (0.7858) = 176.6636 



Table 4.15 



Mean = Expected Value = 10.71 + (-15.716) = -5.006. 

If you make this bet many times under the same conditions, your long term outcome will be an average loss of 
$5.01 per bet. 

Standard Deviation = V648.0964 + 176.6636 « 28.7186 



Try It 



4.8 On May 11, 2013 at 9:30 PM, the probability that moderate seismic activity (one moderate earthquake) would 
occur in the next 48 hours in Japan was about 1.08%. As in Example 4.8, you bet that a moderate earthquake will 
occur in Japan during this period. If you win the bet, you win $100. If you lose the bet, you pay $10. Let X = the 
amount of profit from a bet. Find the mean and standard deviation of X. 



Some of the more common discrete probability functions are binomial, geometric, hypergeometric, and Poisson. Most 
elementary courses do not cover the geometric, hypergeometric, and Poisson. Your instructor will let you know if he or she 
wishes to cover these distributions. 

A probability distribution function is a pattern. You try to fit a probability problem into a pattern or distribution in order 
to perform the necessary calculations. These distributions are tools to make solving probability problems easier. Each 
distribution has its own special characteristics. Learning the characteristics enables you to distinguish among the different 
distributions. 
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4.3 | Binomial Distribution 

There are three characteristics of a binomial experiment. 

1. There are a fixed number of trials. Think of trials as repetitions of an experiment. The letter n denotes the number of 
trials. 

2. There are only two possible outcomes, called "success" and "failure," for each trial. The letter p denotes the probability 
of a success on one trial, and q denotes the probability of a failure on one trial, p + q = 1. 

3. The n trials are independent and are repeated using identical conditions. Because the n trials are independent, the 
outcome of one trial does not help in predicting the outcome of another trial. Another way of saying this is that for each 
individual trial, the probability, p, of a success and probability, q, of a failure remain the same. For example, randomly 
guessing at a true-false statistics question has only two outcomes. If a success is guessing correctly, then a failure is 
guessing incorrectly. Suppose Joe always guesses correctly on any statistics true-false question with probability p = 
0.6. Then, q = 0.4. This means that for every true-false statistics question Joe answers, his probability of success (p = 
0.6) and his probability of failure (q = 0.4) remain the same. 

The outcomes of a binomial experiment fit a binomial probability distribution. The random variable X = the number of 
successes obtained in the n independent trials. 

The mean, p, and variance, a 2 , for the binomial probability distribution are p = np and cr 2 = npq. The standard deviation, a, 
is then a = y npq . 

Any experiment that has characteristics two and three and where n = 1 is called a Bernoulli Trial (named after Jacob 
Bernoulli who, in the late 1600s, studied them extensively). A binomial experiment takes place when the number of 
successes is counted in one or more Bernoulli Trials. 
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Example 4.9 



At ABC College, the withdrawal rate from an elementary physics course is 30% for any given term. This implies 
that, for any given term, 70% of the students stay in the class for the entire term. A "success" could be defined as 
an individual who withdrew. The random variable X = the number of students who withdraw from the randomly 
selected elementary physics class. 



Try It 




4.9 The state health board is concerned about the amount of fruit available in school lunches. Forty-eight percent of 
schools in the state offer fruit in their lunches every day. This implies that 52% do not. What would a "success" be in 
this case? 



Example 4.10 



Suppose you play a game that you can only either win or lose. The probability that you win any game is 55%, 
and the probability that you lose is 45%. Each game you play is independent. If you play the game 20 times, write 
the function that describes the probability that you win 15 of the 20 times. Here, if you define X as the number of 
wins, then X takes on the values 0, 1, 2, 3, ..., 20. The probability of a success is p = 0.55. The probability of a 
failure is q = 0.45. The number of trials is n = 20. The probability question can be stated mathematically as P(x = 
15). 



Try It 



4.10 A trainer is teaching a dolphin to do tricks. The probability that the dolphin successfully performs the trick is 
35%, and the probability that the dolphin does not successfully perform the trick is 65%. Out of 20 attempts, you want 
to find the probability that the dolphin succeeds 12 times. State the probability question mathematically. 



Example 4.11 



A fair coin is flipped 15 times. Each flip is independent. What is the probability of getting more than ten heads? 
Let X = the number of heads in 15 flips of the fair coin. X takes on the values 0, 1, 2, 3, ..., 15. Since the coin is 
fair, p = 0.5 and q = 0.5. The number of trials is n = 15. State the probability question mathematically. 

Solution 4.11 

P(x > 10) 




4.11 A fair, six-sided die is rolled ten times. Each roll is independent. You want to find the probability of rolling a one 
more than three times. State the probability question mathematically. 



Example 4.12 



Approximately 70% of statistics students do their homework in time for it to be collected and graded. Each 
student does homework independendy. In a statistics class of 50 students, what is the probability that at least 40 
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will do their homework on time? Students are selected randomly. 

a. This is a binomial problem because there is only a success or a , there are a fixed number of trials, 

and the probability of a success is 0.70 for each trial. 

Solution 4.12 

a. failure 



b. If we are interested in the number of students who do their homework on time, then how do we define X ? 

Solution 4.12 

b. X = the number of statistics students who do their homework on time 



c. What values does x take on? 

Solution 4.12 

c. 0, 1, 2, ..., 50 

d. What is a "failure," in words? 

Solution 4.12 

d. Failure is defined as a student who does not complete his or her homework on time. 

The probability of a success is p = 0.70. The number of trials is n = 50. 

e. If p + q = 1, then what is q? 

Solution 4.12 

e. q = 0.30 

f. The words "at least" translate as what kind of inequality for the probability question P(x 40). 

Solution 4.12 

f. greater than or equal to (>) 

The probability question is P(x > 40). 



Try It 



4.12 Sixty-five percent of people pass the state driver’s exam on the first try. A group of 50 individuals who have 
taken the driver’s exam is randomly selected. Give two reasons why this is a binomial problem. 



Notation for the Binomial: B = Binomial Probability Distribution Function 

X ~ B(n, p) 

Read this as "X is a random variable with a binomial distribution." The parameters are n and p; n = number of trials, p = 
probability of a success on each trial. 



Example 4.13 



It has been stated that about 41% of adult workers have a high school diploma but do not pursue any further 
education. If 20 adult workers are randomly selected, find the probability that at most 12 of them have a high 
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school diploma but do not pursue any further education. How many adult workers do you expect to have a high 
school diploma but do not pursue any further education? 

Let X = the number of workers who have a high school diploma but do not pursue any further education. 

X takes on the values 0, 1, 2, ..., 20 where n = 20, p = 0.41, and q = 1 - 0.41 = 0.59. X ~ B( 20, 0.41) 

Find P(x < 12). P(x < 12) = 0.9738. (calculator or computer) 




g the TI-83, 83 + , 89, 88+ Calculator 



Go into 2 nd DISTR. The syntax for the instructions are as follows: 

To calculate (x = value): binompdf(n, p, number) if "number" is left out, the result is the binomial 
probability table. 

To calculate P(x < value): binomcdffn, p, number) if "number" is left out, the result is the cumulative 
binomial probability table. 

For this problem: After you are in 2 nd DISTR, arrow down to binomcdf. Press ENTER. Enter 
20,0.41,12). The result is P(x < 12) = 0.9738. 



NOTE 



If you want to find P(x = 12), use the pdf (binompdf). If you want to find P(x > 12), use 1 - 
binomcdf(20, 0.41,12). 



The probability that at most 12 workers have a high school diploma but do not pursue any further education is 
0.9738. 

The graph of X ~ B( 20, 0.41) is as follows: 




The y-axis contains the probability of x, where X = the number of workers who have only a high school diploma. 

The number of adult workers that you expect to have a high school diploma but not pursue any further education 
is the mean, p = np = (20)(0.41) = 8.2. 

2 

The formula for the variance is a = npq. The standard deviation is cr = \jnpq . 
a= V(20)(0.41)(0.59) =2.20. 
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4.13 About 32% of students participate in a community volunteer program outside of school. If 30 students are 
selected at random, find the probability that at most 14 of them participate in a community volunteer program outside 
of school. Use the TI-83+ or TI-84 calculator to find the answer. 



Example 4.14 



In the 2013 Jerry’s Artarama art supplies catalog, there are 560 pages. Eight of the pages feature signature artists. 
Suppose we randomly sample 100 pages. Let X = the number of pages that feature signature artists. 

a. What values does x take on? 

b. What is the probability distribution? Find the following probabilities: 

i. the probability that two pages feature signature artists 

ii. the probability that at most six pages feature signature artists 

iii. the probability that more than three pages feature signature artists. 

c. Using the formulas, calculate the (i) mean and (ii) standard deviation. 

Solution 4.14 

a. x = 0, 1, 2, 3, 4, 5, 6, 7, 8 

b . *~B(ioo,jy 

i. P(x = 2) = binompdf ^100, 

ii. P(x < 6) = binomcdf { 100, 7777-, 

V 560 

iii. P{x > 3) = 1 - P(x < 3) = 1 - binomcdf (lOO, 3) = 1 - 0.9443 = 0.0557 

c. i. Mean = np- (100) (JL) = fg *1.4286 
ii. Standard Deviation = \ Inpq = ^(100)^j^j^52.j ~ 1.1867 



2) = 0.2466 
6) = 0.9994 




4.14 According to a Gallup poll, 60% of American adults prefer saving over spending. Let X = the number of 
American adults out of a random sample of 50 who prefer saving to spending. 

a. What is the probability distribution for XI 

b. Use your calculator to find the following probabilities: 

i. the probability that 25 adults in the sample prefer saving over spending 

ii. the probability that at most 20 adults prefer saving 

iii. the probability that more than 30 adults prefer saving 

c. Using the formulas, calculate the (i) mean and (ii) standard deviation of X. 
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Example 4.15 



The lifetime risk of developing pancreatic cancer is about one in 78 (1.28%). Suppose we randomly sample 200 
people. Let X = the number of people who will develop pancreatic cancer. 

a. What is the probability distribution for X ? 

b. Using the formulas, calculate the (i) mean and (ii) standard deviation of X. 

c. Use your calculator to find the probability that at most eight people develop pancreatic cancer 

d. Is it more likely that five or six people will develop pancreatic cancer? Justify your answer numerically. 

Solution 4.15 

a. X~ B( 200, 0.0128) 

b. i. Mean = np = 200(0.0128) = 2.56 

ii. Standard Deviation = jripq = V(200)(0.0128)(0.9872) « 1.5897 

c. Using the Tl-83, 83+, 84 calculator with instructions as provided in Example 4.13: 

P(x < 8) = binomcdf(200, 0.0128, 8) = 0.9988 

d. P(x = 5) = binompdf(200, 0.0128, 5) = 0.0707 
P(x = 6) = binompdf(200, 0.0128, 6) = 0.0298 

So P(x = 5) > P(x = 6); it is more likely that five people will develop cancer than six. 



Try It 



4.15 During the 2013 regular NBA season, DeAndre Jordan of the Los Angeles Clippers had the highest field goal 
completion rate in the league. DeAndre scored with 61.3% of his shots. Suppose you choose a random sample of 80 
shots made by DeAndre during the 2013 season. Let X = the number of shots that scored points. 

a. What is the probability distribution for XI 

b. Using the formulas, calculate the (i) mean and (ii) standard deviation of X. 

c. Use your calculator to find the probability that DeAndre scored with 60 of these shots. 

d. Find the probability that DeAndre scored with more than 50 of these shots. 



Example 4.16 



The following example illustrates a problem that is not binomial, ft violates the condition of independence. ABC 
College has a student advisory committee made up of ten staff members and six students. The committee wishes 
to choose a chairperson and a recorder. What is the probability that the chairperson and recorder are both students? 
The names of all committee members are put into a box, and two names are drawn without replacement. The 
first name drawn determines the chairperson and the second name the recorder. There are two trials. However, 
the trials are not independent because the outcome of the first trial affects the outcome of the second trial. The 

probability of a student on the first draw is . The probability of a student on the second draw is , when the 



first draw selects a student. The probability is 



_ 6 _ 

15 



, when the first draw selects a staff member. The probability of 



drawing a student's name changes for each of the trials and, therefore, violates the condition of independence. 




4.16 A lacrosse team is selecting a captain. The names of all the seniors are put into a hat, and the first three that are 
drawn will be the captains. The names are not replaced once they are drawn (one person cannot be two captains). You 
want to see if the captains all play the same position. State whether this is binomial or not and state why. 
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4.4 | Geometric Distribution 

There are three main characteristics of a geometric experiment. 

1. There are one or more Bernoulli trials with all failures except the last one, which is a success. In other words, you keep 
repeating what you are doing until the first success. Then you stop. For example, you throw a dart at a bullseye until 
you hit the bullseye. The first time you hit the bullseye is a "success" so you stop throwing the dart, ft might take six 
tries until you hit the bullseye. You can think of the trials as failure, failure, failure, failure, failure, success, STOP. 

2. In theory, the number of trials could go on forever. There must be at least one trial. 

3. The probability, p, of a success and the probability, q, of a failure is the same for each trial, p + q = 1 and q = 1 - p. 
For example, the probability of rolling a three when you throw one fair die is ^ . This is true no matter how many 

times you roll the die. Suppose you want to know the probability of getting the first three on the fifth roll. On rolls 
one through four, you do not get a face with a three. The probability for each of the rolls is q = ^ , the probability of a 

failure. The probability of getting a three on the fifth roll is (g‘)(g)(g)(g)(g) = 0-0804 

X = the number of independent trials until the first success. 



244 CHAPTER 4 | DISCRETE RANDOM VARIABLES 



Example 4.17 



You play a game of chance that you can either win or lose (there are no other possibilities) until you lose. Your 
probability of losing is p = 0.57. What is the probability that it takes five games until you lose? Let X = the number 
of games you play until you lose (includes the losing game). Then X takes on the values 1, 2, 3, ... (could go on 
indefinitely). The probability question is P(x = 5). 



Try It 



4.17 You throw darts at a board until you hit the center area. Your probability of hitting the center area is p = 0.17. 
You want to find the probability that it takes eight throws until you hit the center. What values does X take on? 



Example 4.18 



A safety engineer feels that 35% of all industrial accidents in her plant are caused by failure of employees to 
follow instructions. She decides to look at the accident reports (selected randomly and replaced in the pile after 
reading) until she finds one that shows an accident caused by failure of employees to follow instructions. On 
average, how many reports would the safety engineer expect to look at until she finds a report showing an 
accident caused by employee failure to follow instructions? What is the probability that the safety engineer will 
have to examine at least three reports until she finds a report showing an accident caused by employee failure to 
follow instructions? 

Let X = the number of accidents the safety engineer must examine until she finds a report showing an accident 
caused by employee failure to follow instructions. X takes on the values 1, 2, 3, .... The first question asks you 
to find the expected value or the mean. The second question asks you to find P(x > 3). ("At least" translates to a 
"greater than or equal to" symbol). 



Try It T* 




4.18 An instructor feels that 15% of students get below a C on their final exam. She decides to look at final exams 
(selected randomly and replaced in the pile after reading) until she finds one that shows a grade below a C. We want to 
know the probability that the instructor will have to examine at least ten exams until she finds one with a grade below 
a C. What is the probability question stated mathematically? 



Example 4.19 



Suppose that you are looking for a student at your college who lives within five miles of you. You know that 55% 
of the 25,000 students do live within five miles of you. You randomly contact students from the college until one 
says he or she lives within five miles of you. What is the probability that you need to contact four people? 

This is a geometric problem because you may have a number of failures before you have the one success you 
desire. Also, the probability of a success stays the same each time you ask a student if he or she lives within five 
miles of you. There is no definite number of trials (number of times you ask a student). 

a. Let X = the number of you must ask one says yes. 

Solution 4.19 

a. Let X = the number of students you must ask until one says yes. 



b. What values does X take on? 

Solution 4.19 
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b. 1, 2, 3, (total number of students) 

c. What are p and q ? 

Solution 4.19 

c. p = 0.55; q = 0.45 

d. The probability question is P( ). 

Solution 4.19 

d. P(x = 4) 



Try It T* 




* 

v 



4.19 You need to find a store that carries a special printer ink. You know that of the stores that carry printer ink, 10% 
of them carry the special ink. You randomly call each store until one has the ink you need. What are p and q? 



Notation for the Geometric: G = Geometric Probability Distribution Function 

X~G(p ) 

Read this as "X is a random variable with a geometric distribution." The parameter is p; p = the probability of a success 
for each trial. 



Example 4.20 



Assume that the probability of a defective computer component is 0.02. Components are randomly selected. Find 
the probability that the first defect is caused by the seventh component tested. How many components do you 
expect to test until one is found to be defective? 

Let X = the number of computer components tested until the first defect is found. 

X takes on the values 1, 2, 3, ... where p = 0.02. X ~ G(0.02) 

Find P(x = 7). P(x = 7) = 0.0177. 







the 



TI-83, 83 + , 83, 83+ Calculator 



To find the probability that x-7 , 

• Enter 2 nd , D1STR 

• Scroll down and select geometpdf( 

• Press ENTER 

• Enter 0.02, 7); press ENTER to see the result: P(x = 7) = 0.0177 



To find the probability that x <7, follow the same instructions EXCEPT select E:geometcdf(as the distribution 
function. 

The probability that the seventh component is the first defect is 0.0177. 

The graph of X ~ G(0.02) is: 
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Figure 4.3 

The y-axis contains the probability of x, where X = the number of computer components tested. 

The number of components that you would expect to test until you find the first defective one is the mean, 
/j = 50 . 

The formula for the mean is p = ^ ^ = 50 

The formula for the variance is o 2 = — l) = ( q 02 )( 0 02 — l) = ^>450 

The standard deviation is a = '|/(1)(1 - l) = " ! ) = 49 - 5 



Try It 



4.20 The probability of a defective steel rod is 0.01. Steel rods are selected at random. Find the probability that 
the first defect occurs on the ninth steel rod. Use the T1-83+ or Tl-84 calculator to find the answer. 



Example 4.21 



The lifetime risk of developing pancreatic cancer is about one in 78 (1.28%). Let X = the number of people 
you ask until one says he or she has pancreatic cancer. Then X is a discrete random variable with a geometric 

j orX~G(0.0128). 

a. What is the probability of that you ask ten people before one says he or she has pancreatic cancer? 

b. What is the probability that you must ask 20 people? 

c. Find the (i) mean and (ii) standard deviation of X. 

Solution 4.21 

a. P(x = 10) = geometpdf(0.0128, 10) = 0.0114 

b. P(x = 20) = geometpdf(0.0128, 20) = 0.01 

c. i. Mean = p= 1 = ^28 =78 



distribution: X ~ G 



Mr 
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ii. 



Standard Deviation = a = 




1 - 0.0128 * 77 6234 
0 . 0128 2 




4.21 The literacy rate for a nation measures the proportion of people age 15 and over who can read and write. The 
literacy rate for women in Afghanistan is 12%. Let X = the number of Afghani women you ask until one says that she 
is literate. 

a. What is the probability distribution of X? 

b. What is the probability that you ask five women before one says she is literate? 

c. What is the probability that you must ask ten women? 

d. Find the (i) mean and (ii) standard deviation of X. 



4.5 | Hypergeometric Distribution 

There are five characteristics of a hypergeometric experiment. 

1. You take samples from two groups. 

2. You are concerned with a group of interest, called the first group. 

3. You sample without replacement from the combined groups. For example, you want to choose a softball team from 
a combined group of 11 men and 13 women. The team consists of ten players. 

4. Each pick is not independent, since sampling is without replacement. In the softball example, the probability of 

ia ii in 

picking a woman first is ^ . The probability of picking a man second is ^ if a woman was picked first, ft is if 

a man was picked first. The probability of the second pick depends on what happened in the first pick. 

5. You are not dealing with Bernoulli Trials. 

The outcomes of a hypergeometric experiment fit a hypergeometric probability distribution. The random variable X = the 
number of items from the group of interest. 
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Example 4.22 



A candy dish contains 100 jelly beans and 80 gumdrops. Fifty candies are picked at random. What is the 
probability that 35 of the 50 are gumdrops? The two groups are jelly beans and gumdrops. Since the probability 
question asks for the probability of picking gumdrops, the group of interest (first group) is gumdrops. The size of 
the group of interest (first group) is 80. The size of the second group is 100. The size of the sample is 50 (jelly 
beans or gumdrops). Let X = the number of gumdrops in the sample of 50. X takes on the values x = 0, 1, 2, ..., 
50. What is the probability statement written mathematically? 

Solution 4.22 

P(x = 35) 




4.22 A bag contains letter tiles. Forty-four of the tiles are vowels, and 56 are consonants. Seven tiles are picked at 
random. You want to know the probability that four of the seven tiles are vowels. What is the group of interest, the size 
of the group of interest, and the size of the sample? 



Example 4.23 



Suppose a shipment of 100 DVD players is known to have ten defective players. An inspector randomly chooses 
12 for inspection. Fie is interested in determining the probability that, among the 12 players, at most two are 
defective. The two groups are the 90 non-defective DVD players and the 10 defective DVD players. The group 
of interest (first group) is the defective group because the probability question asks for the probability of at most 
two defective DVD players. The size of the sample is 12 DVD players. (They may be non-defective or defective.) 
Let X = the number of defective DVD players in the sample of 12. X takes on the values 0, 1, 2, ..., 10. X may 
not take on the values 11 or 12. The sample size is 12, but there are only 10 defective DVD players. Write the 
probability statement mathematically. 

Solution 4.23 

P(x < 2) 



Try It 



4.23 A gross of eggs contains 144 eggs. A particular gross is known to have 12 cracked eggs. An inspector randomly 
chooses 15 for inspection. She wants to know the probability that, among the 15, at most three are cracked. What is X, 
and what values does it take on? 



Example 4.24 



You are president of an on-campus special events organization. You need a committee of seven students to plan a 
special birthday party for the president of the college. Your organization consists of 18 women and 15 men. You 
are interested in the number of men on your committee. If the members of the committee are randomly selected, 
what is the probability that your committee has more than four men? 

This is a hypergeometric problem because you are choosing your committee from two groups (men and women), 
a. Are you choosing with or without replacement? 

Solution 4.24 

a. without 
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b. What is the group of interest? 

Solution 4.24 

b. the men 



c. How many are in the group of interest? 

Solution 4.24 

c. 15 men 

d. How many are in the other group? 

Solution 4.24 

d. 18 women 

e. Let X = on the committee. What values does X take on? 

Solution 4.24 

e. Let X = the number of men on the committee, x = 0, 1, 2, . . ., 7. 

f. The probability question is P( ). 

Solution 4.24 

f. P(x > 4) 



Try It 




4.24 A palette has 200 milk cartons. Of the 200 cartons, it is known that ten of them have leaked and cannot be sold. 
A stock clerk randomly chooses 18 for inspection. He wants to know the probability that among the 18, no more than 
two are leaking. Give five reasons why this is a hypergeometric problem. 



Notation for the Hypergeometric: H = Hypergeometric Probability 
Distribution Function 

X ~ H(r, b, n ) 

Read this as "X is a random variable with a hypergeometric distribution." The parameters are r, b, and n; r = the size of the 
group of interest (first group), b = the size of the second group, n = the size of the chosen sample. 



Example 4.25 



A school site committee is to be chosen randomly from six men and five women. If the committee consists of 
four members chosen randomly, what is the probability that two of them are men? How many men do you expect 
to be on the committee? 

Let X = the number of men on the committee of four. The men are the group of interest (first group). 

X takes on the values 0, 1, 2, 3, 4, where r = 6, b = 5, and n = 4. X ~ H( 6, 5, 4) 

Find P(x = 2). P(x = 2) = 0.4545 (calculator or computer) 
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NOTE 

Currently, the TI-83+ and TI-84 do not have hypergeometric probability functions. There are a number 
of computer packages, including Microsoft Excel, that do. 



The probability that there are two men on the committee is about 0.45. 
The graph of X ~ H( 6, 5, 4) is: 




X 

Figure 4.4 

The y-axis contains the probability of X, where X = the number of men on the committee. 
You would expect m = 2.18 (about two) men on the committee. 

The formula for the mean is u = nr , = =2.18 

H r+b 6+5 



Try It t* 



4.25 An intramural basketball team is to be chosen randomly from 15 boys and 12 girls. The team has ten slots. You 
want to know the probability that eight of the players will be boys. What is the group of interest and the sample? 



4.6 | Poisson Distribution 

There are two main characteristics of a Poisson experiment. 

1. The Poisson probability distribution gives the probability of a number of events occurring in a fixed interval of 
time or space if these events happen with a known average rate and independently of the time since the last event. For 
example, a book editor might be interested in the number of words spelled incorrectly in a particular book. It might be 
that, on the average, there are five words spelled incorrectly in 100 pages. The interval is the 100 pages. 

2. The Poisson distribution may be used to approximate the binomial if the probability of success is "small" (such as 
0.01) and the number of trials is "large" (such as 1,000). You will verify the relationship in the homework exercises, n 
is the number of trials, and p is the probability of a "success." 

The random variable X = the number of occurrences in the interval of interest. 
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Example 4.26 



The average number of loaves of bread put on a shelf in a bakery in a half-hour period is 12. Of interest is the 
number of loaves of bread put on the shelf in five minutes. The time interval of interest is five minutes. What is 
the probability that the number of loaves, selected randomly, put on the shelf in five minutes is three? 

Let X = the number of loaves of bread put on the shelf in five minutes. If the average number of loaves put on 
the shelf in 30 minutes (half-hour) is 12, then the average number of loaves put on the shelf in five minutes is 

(^) (12) = 2 loaves of bread. 

The probability question asks you to find P(x = 3). 



4.26 The average number of fish caught in an hour is eight. Of interest is the number of fish caught in 15 minutes. 
The time interval of interest is 15 minutes. What is the average number of fish caught in 15 minutes? 



Example 4.27 



A bank expects to receive six bad checks per day, on average. What is the probability of the bank getting fewer 
than five bad checks on any given day? Of interest is the number of checks the bank receives in one day, so the 
time interval of interest is one day. Let X = the number of bad checks the bank receives in one day. If the bank 
expects to receive six bad checks per day then the average is six checks per day. Write a mathematical statement 
for the probability question. 

Solution 4.27 

P(x < 5) 



4.27 An electronics store expects to have ten returns per day on average. The manager wants to know the probability 
of the store getting fewer than eight returns on any given day. State the probability question mathematically. 



Example 4.28 



You notice that a news reporter says "uh," on average, two times per broadcast. What is the probability that the 
news reporter says "uh" more than two times per broadcast. 

This is a Poisson problem because you are interested in knowing the number of times the news reporter says "uh" 
during a broadcast. 

a. What is the interval of interest? 

Solution 4.28 

a. one broadcast 

b. What is the average number of times the news reporter says "uh" during one broadcast? 

Solution 4.28 

b. 2 
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c. Let X = . What values does X take on? 

Solution 4.28 

c. Let X = the number of times the news reporter says "uh" during one broadcast, 
x = 0, 1, 2, 3, ... 

d. The probability question is P( ). 

Solution 4.28 

d. P(x > 2) 



Try It 




4.28 An emergency room at a particular hospital gets an average of five patients per hour. A doctor wants to know the 
probability that the ER gets more than five patients per hour. Give the reason why this would be a Poisson distribution. 



Notation for the Poisson: P = Poisson Probability Distribution Function 

X~PQi) 

Read this as "X is a random variable with a Poisson distribution." The parameter is p (or A); p (or A) = the mean for the 
interval of interest. 



Example 4.29 



Leah's answering machine receives about six telephone calls between 8 a.m. and 10 a.m. What is the probability 
that Leah receives more than one call in the next 15 minutes? 

Let X = the number of calls Leah receives in 15 minutes. (The interval of interest is 15 minutes or 1 hour.) 



x = 0, 1, 2, 3, ... 

If Leah receives, on the average, six telephone calls in two hours, and there are eight 15 minute intervals in two 
hours, then Leah receives 

(g - ) (6) = 0.75 calls in 15 minutes, on average. So, p = 0.75 for this problem. 

X ~ P(0.75) 

Find P(x > 1). P(x > 1) = 0.1734 (calculator or computer) 



□ 



Using the TI-83, 83 + , 84, 84+ Calculator 



• Press 1 - and then press 2 nd DISTR. 

• Arrow down to poissoncdf. Press ENTER. 

• Enter (.75,1). 

• The result is P(x > 1) = 0.1734. 
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NOTE 

The TI calculators use X (lambda) for the mean. 



The probability that Leah receives more than one telephone call in the next 15 minutes is about 0.1734: 
P{x > 1) = 1 - poissoncdf(0.75, 1). 

The graph of X ~ P( 0.75) is: 




Figure 4.5 

They-axis contains the probability of x where X = the number of calls in 15 minutes. 



Try It 



4.29 A customer service center receives about ten emails every half-hour. What is the probability that the 
customer service center receives more than four emails in the next six minutes? Use the TI-83+ or TI-84 calculator to 
find the answer. 



Example 4.30 



According to Baydin, an email management company, an email user gets, on average, 147 emails per day. Let X 
= the number of emails an email user receives per day. The discrete random variable X takes on the values x = 0, 
1,2 .... The random variable X has a Poisson distribution: X ~ P(147). The mean is 147 emails. 

a. What is the probability that an email user receives exactly 160 emails per day? 

b. What is the probability that an email user receives at most 160 emails per day? 

c. What is the standard deviation? 

Solution 4.30 

a. P(x = 160) = poissonpdf(147, 160) ~ 0.0180 

b. P(x < 160) = poissoncdf(147, 160) » 0.8666 
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c. Standard Deviation = a = \[p. = V 147 « 12.1244 




4.30 According to a recent poll by the Pew Internet Project, girls between the ages of 14 and 17 send an average of 

187 text messages each day. Let X = the number of texts that a girl aged 14 to 17 sends per day. The discrete random 

variable X takes on the values x = 0,l,2 .... The random variable X has a Poisson distribution: X ~ P(187). The mean 
is 187 text messages. 

a. What is the probability that a teen girl sends exactly 175 texts per day? 

b. What is the probability that a teen girl sends at most 150 texts per day? 

c. What is the standard deviation? 



Example 4.31 



Text message users receive or send an average of 41.5 text messages per day. 

a. How many text messages does a text message user receive or send per hour? 

b. What is the probability that a text message user receives or sends two messages per hour? 

c. What is the probability that a text message user receives or sends more than two messages per hour? 

Solution 4.31 

a. Let X = the number of texts that a user sends or receives in one hour. The average number of texts received 
per hour is ~ 1-7292. 

b. X~ P( 1.7292), so P(x = 2) = poissonpdf(1.7292, 2) ~ 0.2653 

c. P(x > 2) = 1 - P(x < 2) = 1 - poissoncdf(1.7292, 2) » 1 - 0.7495 = 0.2505 



Try It 



4.31 Atlanta’s Hartsfield-Jackson International Airport is the busiest airport in the world. On average there are 2,500 
arrivals and departures each day. 

a. How many airplanes arrive and depart the airport per hour? 

b. What is the probability that there are exactly 100 arrivals and departures in one hour? 

c. What is the probability that there are at most 100 arrivals and departures in one hour? 



Example 4.32 



On May 13, 2013, starting at 4:30 PM, the probability of low seismic activity for the next 48 hours in Alaska 
was reported as about 1.02%. Use this information for the next 200 days to find the probability that there will be 
low seismic activity in ten of the next 200 days. Use both the binomial and Poisson distributions to calculate the 
probabilities. Are they close? 

Solution 4.32 

Let X = the number of days with low seismic activity. 

Using the binomial distribution: 
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• P(x = 10) = binompdf(200, .0102, 10) » 0.000039 
Using the Poisson distribution: 

• Calculate p = np = 200(0.0102) ~ 2.04 

• P(x = 10) = poissonpdf(2.04, 10) ~ 0.000045 

We expect the approximation to be good because n is large (greater than 20) and p is small (less than 0.05). The 
results are close — both probabilities reported are almost 0. 



Try It t* 



4.32 On May 13, 2013, starting at 4:30 PM, the probability of moderate seismic activity for the next 48 hours in the 
Kuril Islands off the coast of Japan was reported at about 1.43%. Use this information for the next 100 days to find 
the probability that there will be low seismic activity in five of the next 100 days. Use both the binomial and Poisson 
distributions to calculate the probabilities. Are they close? 



4.7 | Discrete Distribution (Playing Card Experiment) 
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4.1 Discrete Distribution (Playing Card Experiment) 

Class Time: 

Names: 

Student Learning Outcomes 

• The student will compare empirical data and a theoretical distribution to determine if an everyday experiment fits 
a discrete distribution. 

• The student will demonstrate an understanding of long-term probabilities. 

Supplies 

• One full deck of playing cards 

Procedure 

The experimental procedure is to pick one card from a deck of shuffled cards. 

1. The theoretical probability of picking a diamond from a deck is . 

2. Shuffle a deck of cards. 

3. Pick one card from it. 

4. Record whether it was a diamond or not a diamond. 

5. Put the card back and reshuffle. 

6. Do this a total of ten times. 

7. Record the number of diamonds picked. 

8. Let X = number of diamonds. Theoretically, X ~ B( , ) 

Organize the Data 

1. Record the number of diamonds picked for your class in Table 4.16. Then calculate the relative frequency. 



X 


Frequency 


Relative Frequency 


0 






1 






2 






3 






4 






5 






6 






7 






8 






9 






10 







Table 4.16 



2. Calculate the following: 
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a. X = 



b. s = 

3. Construct a histogram of the empirical data. 
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Figure 4.6 



Number of diamonds 



Theoretical Distribution 

a. Build the theoretical PDF chart based on the distribution in the Procedure section. 



b. Calculate the following: 

a- F= 



X 


P(x) 


0 




1 




2 




3 




4 




5 




6 




7 




8 




9 




10 





Table 4.17 



b. a = 

c. Construct a histogram of the theoretical distribution. 



Figure 4.7 



This is a blank graph template. The x-axis is labeled Number of diamonds. 
The y-axis is labeled Probability. 



Using the Data 
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NOTE 

RF = relative frequency 



Use the table from the Theoretical Distribution section to calculate the following answers. Round your answers to 
four decimal places. 

• P(x = 3)= 

• P(1 < x < 4) = 

• P(x> 8)= 

Use the data from the Organize the Data section to calculate the following answers. Round your answers to four 
decimal places. 

• RF(x = 3) = 

• RF( l<x<4) = 

• RF(x > 8) = 

Discussion Questions 

For questions 1 and 2, think about the shapes of the two graphs, the probabilities, the relative frequencies, the means, 
and the standard deviations. 

1. Knowing that data vary, describe three similarities between the graphs and distributions of the theoretical and 
empirical distributions. Use complete sentences. 

2. Describe the three most significant differences between the graphs or distributions of the theoretical and empirical 
distributions. 

3. Using your answers from questions 1 and 2, does it appear that the data fit the theoretical distribution? In complete 
sentences, explain why or why not. 

4. Suppose that the experiment had been repeated 500 times. Would you expect Table 4.16 or Table 4.17 to 
change, and how would it change? Why? Why wouldn’t the other table change? 



4.8 | Discrete Distribution (Lucky Dice Experiment) 
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4.2 Discrete Distribution (Lucky Dice Experiment) 

Class Time: 

Names: 

Student Learning Outcomes 

• The student will compare empirical data and a theoretical distribution to determine if a Tet gambling game fits a 
discrete distribution. 

• The student will demonstrate an understanding of long-term probabilities. 

Supplies 

• one “Lucky Dice” game or three regular dice 

Procedure 

Round answers to relative frequency and probability problems to four decimal places. 

1. The experimental procedure is to bet on one object. Then, roll three Lucky Dice and count the number of matches. 
The number of matches will decide your profit. 

2. What is the theoretical probability of one die matching the object? 

3. Choose one object to place a bet on. Roll the three Lucky Dice. Count the number of matches. 

4. Let X = number of matches. Theoretically, X ~ B( , ) 

5. Let Y= profit per game. 

Organize the Data 

In Table 4.18, fill in the y value that corresponds to each x value. Next, record the number of matches picked for your 
class. Then, calculate the relative frequency. 

1. Complete the table. 



X 


y 


Frequency 


Relative Frequency 


0 








1 








2 








3 









Table 4.18 



2. Calculate the following: 

a. x = 

b. Sx = 

c- y = 

d . Sy — 

3. Explain what x represents. 

4. Explain what y represents. 
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5. Based upon the experiment: 

a. What was the average profit per game? 

b. Did this represent an average win or loss per game? 

c. How do you know? Answer in complete sentences. 

6. Construct a histogram of the empirical data. 



c 

01 

3 

T 

0) 

H- 

Q) 

> 

15 

CD 

cr 



Number of matches 

Figure 4.8 



Theoretical Distribution 

Build the theoretical PDF chart for x and y based on the distribution from the Procedure section. 



X 


y 


P(x) = P(y) 


0 






1 






2 






3 







Table 4.19 



2. Calculate the following: 

a. p x = 

b. o x = 

c. px = 

3. Explain what p x represents. 

4. Explain what py represents. 

5. Based upon theory: 

a. What was the expected profit per game? 

b. Did the expected profit represent an average win or loss per game? 

c. How do you know? Answer in complete sentences. 

6. Construct a histogram of the theoretical distribution. 
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Figure 4.9 



Number of matches 



Use the Data 



NOTE 

RF = relative frequency 



Use the data from the Theoretical Distribution section to calculate the following answers. Round your answers to 
four decimal places. 

1. P(x = 3) = 

2. P(0<x<3) = 

3. P(x > 2) = 

Use the data from the Organize the Data section to calculate the following answers. Round your answers to four 
decimal places. 

1. RF(x = 3) = 

2. PF( 0<x<3) = 

3. RF(x > 2) = 

Discussion Question 

For questions 1 and 2, consider the graphs, the probabilities, the relative frequencies, the means, and the standard 
deviations. 

1. Knowing that data vary, describe three similarities between the graphs and distributions of the theoretical and 
empirical distributions. Use complete sentences. 

2. Describe the three most significant differences between the graphs or distributions of the theoretical and empirical 
distributions. 

3. Thinking about your answers to questions 1 and 2, does it appear that the data fit the theoretical distribution? In 
complete sentences, explain why or why not. 

4. Suppose that the experiment had been repeated 500 times. Would you expect Table 4.18 or Table 4.19 to 
change, and how would it change? Why? Why wouldn’t the other table change? 
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KEY TERMS 

Bernoulli Trials an experiment with the following characteristics: 

1. There are only two possible outcomes called “success” and “failure” for each trial. 

2. The probability p of a success is the same for any trial (so the probability q = 1 - p of a failure is the same for 
any trial). 

Binomial Experiment a statistical experiment that satisfies the following three conditions: 

1. There are a fixed number of trials, n. 

2. There are only two possible outcomes, called "success" and, "failure," for each trial. The letter p denotes the 
probability of a success on one trial, and q denotes the probability of a failure on one trial. 

3. The n trials are independent and are repeated using identical conditions. 

Binomial Probability Distribution a discrete random variable (RV) that arises from Bernoulli trials; there are a fixed 
number, n, of independent trials. “Independent” means that the result of any trial (for example, trial one) does not 
affect the results of the following trials, and all trials are conducted under the same conditions. Under these 
circumstances the binomial RV X is defined as the number of successes in n trials. The notation is: X ~ B(n, p). The 
mean is p = np and the standard deviation is cr = y npq . The probability of exactly x successes in n trials is 

P(X = x)= O x q n - x . 

Expected Value expected arithmetic average when an experiment is repeated many times; also called the mean. 
Notations: p. For a discrete random variable (RV) with probability distribution function P(x),the definition can also 

be written in the form p = ^ xP(x). 

Geometric Distribution a discrete random variable (RV) that arises from the Bernoulli trials; the trials are repeated 
until the first success. The geometric variable X is defined as the number of trials until the first success. Notation: X 

~ G(p). The mean is p = and the standard deviation is cr = ]j^p — 1 ) . The probability of exacdy x failures 
before the first success is given by the formula: P(X = x) = p(l - p) x ~ 1 . 

Geometric Experiment a statistical experiment with the following properties: 

1. There are one or more Bernoulli trials with all failures except the last one, which is a success. 

2. In theory, the number of trials could go on forever. There must be at least one trial. 

3. The probability, p, of a success and the probability, q, of a failure do not change from trial to trial. 

Hypergeometric Experiment a statistical experiment with the following properties: 

1. You take samples from two groups. 

2. You are concerned with a group of interest, called the first group. 

3. You sample without replacement from the combined groups. 

4. Each pick is not independent, since sampling is without replacement. 

5. You are not dealing with Bernoulli Trials. 

Hypergeometric Probability a discrete random variable (RV) that is characterized by: 

1. A fixed number of trials. 

2. The probability of success is not the same from trial to trial. 

We sample from two groups of items when we are interested in only one group. X is defined as the number of 
successes out of the total number of items chosen. Notation: X ~ H(r, b, n), where r = the number of items in the 
group of interest, b = the number of items in the group not of interest, and n = the number of items chosen. 

Mean a number that measures the central tendency; a common name for mean is ‘average.’ The term ‘mean’ is a 
shortened form of ‘arithmetic mean.’ By definition, the mean for a sample (detonated by x ) is 
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_ _ Sum of all values in the sample ^ ^ 

Number of values in the sample 

Sum of all values in the population 

Number of values in the population 



the mean for a population (denoted by p) is p = 



Mean of a Probability Distribution the long-term average of many trials of a statistical experiment 

Poisson Probability Distribution a discrete random variable (RV) that counts the number of times a certain event 
will occur in a specific interval; characteristics of the variable: 

• The probability that the event occurs in a given interval is the same for all intervals. 

• The events occur with a known mean and independently of the time since the last event. 

The distribution is defined by the mean p of the event in the interval. Notation: X ~ P(p). The mean is p = np. The 

standard deviation is a = \fji . The probability of having exactly x successes in r trials is P(X = x ) = (e . 

The Poisson distribution is often used to approximate the binomial distribution, when n is “large” and p is “small” 
(a general rule is that n should be greater than or equal to 20 and p should be less than or equal to 0.05). 

Probability Distribution Function (PDF) a mathematical description of a discrete random variable (RV), given 
either in the form of an equation (formula) or in the form of a table listing all the possible outcomes of an 
experiment and the probability associated with each outcome. 



Random Variable (RV) a characteristic of interest in a population being studied; common notation for variables are 
upper case Latin letters X, Y, Z,...; common notation for a specific value from the domain (set of all possible values 
of a variable) are lower case Latin letters x, y, and z. For example, if X is the number of children in a family, then x 
represents a specific integer 0, 1, 2, 3,.... Variables in statistics differ from variables in intermediate algebra in the 
two following ways. 

• The domain of the random variable (RV) is not necessarily a numerical set; the domain may be expressed in 
words; for example, if X = hair color then the domain is {black, blond, gray, green, orange}. 

• We can tell what specific value x the random variable X takes only after performing the experiment. 

Standard Deviation of a Probability Distribution a number that measures how far the outcomes of a statistical 
experiment are from the mean of the distribution 



The Law of Large Numbers As the number of trials in a probability experiment increases, the difference between the 
theoretical probability of an event and the relative frequency probability approaches zero. 



CHAPTER REVIEW 



4.1 Probability Distribution Function (PDF) for a Discrete Random Variable 

The characteristics of a probability distribution function (PDF) for a discrete random variable are as follows: 

1. Each probability is between zero and one, inclusive ( inclusive means to include zero and one). 

2. The sum of the probabilities is one. 

4.2 Mean or Expected Value and Standard Deviation 

The expected value, or mean, of a discrete random variable predicts the long-term results of a statistical experiment that has 
been repeated many times. The standard deviation of a probability distribution is used to measure the variability of possible 
outcomes. 

4.3 Binomial Distribution 

A statistical experiment can be classified as a binomial experiment if the following conditions are met: 

1. There are a fixed number of trials, n. 

2. There are only two possible outcomes, called "success" and, "failure" for each trial. The letter p denotes the probability 
of a success on one trial and q denotes the probability of a failure on one trial. 
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3. The n trials are independent and are repeated using identical conditions. 

The outcomes of a binomial experiment fit a binomial probability distribution. The random variable X = the number of 
successes obtained in the n independent trials. The mean of X can be calculated using the formula p = np, and the standard 
deviation is given by the formula a = \jnpq . 

4.4 Geometric Distribution 

There are three characteristics of a geometric experiment: 

1. There are one or more Bernoulli trials with all failures except the last one, which is a success. 

2. In theory, the number of trials could go on forever. There must be at least one trial. 

3. The probability, p, of a success and the probability, q, of a failure are the same for each trial. 

In a geometric experiment, define the discrete random variable X as the number of independent trials until the first success. 
We say that X has a geometric distribution and write X ~ G(p) where p is the probability of success in a single trial. 



4.5 Hypergeometric Distribution 

A hypergeometric experiment is a statistical experiment with the following properties: 

1. You take samples from two groups. 

2. You are concerned with a group of interest, called the first group. 

3. You sample without replacement from the combined groups. 

4. Each pick is not independent, since sampling is without replacement. 

5. You are not dealing with Bernoulli Trials. 

The outcomes of a hypergeometric experiment fit a hypergeometric probability distribution. The random variable X = the 
number of items from the group of interest. The distribution of X is denoted X ~ H(r, b, n), where r = the size of the group 
of interest (first group), b = the size of the second group, and n = the size of the chosen sample, ft follows that 



a fixed interval of time or space, if these events happen at a known average rate and independently of the time since the last 
event. The Poisson distribution may be used to approximate the binomial, if the probability of success is "small" (less than 
or equal to 0.05) and the number of trials is "large" (greater than or equal to 20). 




n<r + b. The mean of X is p = 1,1 , and the standard deviation is <7 = 

F r + b 




4.6 Poisson Distribution 



A Poisson probability distribution of a discrete random variable gives the probability of a number of events occurring in 



FORMULA REVIEW 



Mean or Expected Value: p — ^ xP(x) 

xsX 



4.2 Mean or Expected Value and Standard 
Deviation 



4.3 Binomial Distribution 




X ~ B(n, p) means that the discrete random variable X 
has a binomial probability distribution with n trials and 
probability of success p. 

X = the number of successes in n independent trials 
n = the number of independent trials 
X takes on the values x = 0, 1, 2, 3, ..., n 
p = the probability of a success for any trial 
q = the probability of a failure for any trial 

p + q = 1 
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